- Motivation
- Inferenzstatistik
- Item Response Theorie
- Bayes’sche IRT in R
- Showroom: birtms
- Methodik: Bayes’sche IRT-Modellierung
- R-Package-Premiere: birtms
- fachdidaktische Erkenntnisse
- birtms Tutorial
- Item Response Theorie
- Interesse an Statistik
15.9.2021
| Variable | TAM | brms |
|---|---|---|
| Zeit in s | 0.06 | 138 |
| Iterationen | 36 | 4000 + 4000 + X |
| Speicher in MB | 0.46 | 34.73 |
| Werte je Parameter | 3 | 4000 |
i1_easyness <- brms::ranef(fit_1d_1pl_spm, summary = FALSE)$item[,"i1",1] head(i1_easyness, n = 48) %>% round(2)
## [1] 0.89 0.54 1.20 0.36 0.57 0.67 0.97 0.55 1.29 1.18 1.71 1.68 ## [13] 2.05 1.36 1.22 1.41 1.46 1.15 -0.26 0.26 0.43 0.10 0.42 0.71 ## [25] 0.51 0.80 1.07 1.16 1.26 0.90 0.85 0.29 0.10 -0.13 0.36 0.74 ## [37] 0.20 1.10 0.82 0.28 0.01 0.38 1.09 0.94 0.24 0.13 0.27 -0.15
i1_easyness %>% birtms::plot_ppmc_distribution()
\[P\left(1|\theta_k, \beta_i\right)=\frac{\exp\left(\theta_k + \beta_i\right)}{1+\exp\left(\theta_k + \beta_i\right)}\]
\[P\left(1|\theta_k, \beta_i\right)=\frac{\exp\left(\theta_k + \beta_i\right)}{1+\exp\left(\theta_k + \beta_i\right)}\]
\[P\left(1|\theta_k, \beta_i\right)=\frac{\exp\left(\theta_k + \beta_i\right)}{1+\exp\left(\theta_k + \beta_i\right)}\]
data_spm_long <- data_spm %>%
birtms::compose_dataset(
response_columns = i1:i12)(formula1PL <- birtms::build_formula())
## response ~ 1 + (1 | person) + (1 | item)
prior_1PL <- brms::prior("normal(0, 3)", class = "sd", group = "person") +
brms::prior("normal(0, 3)", class = "sd", group = "item")fit_1d_1pl_spm <- birtms::birtm( data = data_spm_long, formula = formula1PL, prior = prior_1PL, file = "models/fit_1d_1pl_spm" )
summary(fit_1d_1pl_spm, robust = TRUE)
## Family: bernoulli ## Links: mu = logit ## Formula: response ~ 1 + (1 | person) + (1 | item) ## Data: data (Number of observations: 5988) ## Draws: 4 chains, each with iter = 2000; warmup = 1000; thin = 1; ## total post-warmup draws = 4000 ## ## Group-Level Effects: ## ~item (Number of levels: 12) ## Estimate Est.Error l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS ## sd(Intercept) 1.49 0.32 1.02 2.46 1.00 1193 2063 ## ## ~person (Number of levels: 499) ## Estimate Est.Error l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS ## sd(Intercept) 1.68 0.08 1.54 1.85 1.00 1561 2524 ## ## Population-Level Effects: ## Estimate Est.Error l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS ## Intercept 1.05 0.43 0.18 2.02 1.01 682 1213 ## ## Draws were sampled using sampling(NUTS). For each parameter, Bulk_ESS ## and Tail_ESS are effective sample size measures, and Rhat is the potential ## scale reduction factor on split chains (at convergence, Rhat = 1).
bayesplot::mcmc_trace(
fit_1d_1pl_spm, pars = c("b_Intercept")
)
g_spm_2pl <- birtms::ICC_check( fit_1d_2pl_spm, post_responses = posterior_responses_spm_2pl, num_groups = 6, item_id = 12, ellipse_type = "axisparallel")
marg_loo1pl <- birtms::loo_marginal(fit_1d_1pl_spm) marg_loo2pl <- birtms::loo_marginal(fit_1d_2pl_spm)
loo::loo_compare(marg_loo1pl, marg_loo2pl)
## elpd_diff se_diff ## model2 0.0 0.0 ## model1 -25.4 11.1
ordata_1pl_spm_fullbayes %>% birtms::plot_ppmc_or_heatmap()
ordata_2pl_spm_fullbayes %>% birtms::plot_ppmc_or_heatmap()
ordata_2pl_spm_fullbayes %>%
birtms::plot_or_heatmap(
model = fit_1d_2pl_spm)ordata_2pl_spm_fullbayes %>%
birtms::plot_or_heatmap(
model = fit_1d_2pl_spm, bayesian = TRUE)fit_1d_1pl_spm %>% birtms::plot_wrightmap(classic = FALSE)
Bürkner, P.-C. (2019). Bayesian Item Response Modeling in R with brms and Stan. Verfügbar unter: https://arxiv.org/pdf/1905.09501
Bürkner, P.-C. (2020). Analysing Standard Progressive Matrices (SPM-LS) with Bayesian Item Response Models. Journal of Intelligence, 8(1). https://doi.org/10.3390/jintelligence8010005
Clark, M. (2018). Bayesian Basics. Verfügbar unter: https://m-clark.github.io/bayesian-basics/diagnostics.html
Fujimoto, K. A. & Neugebauer, S. R. (2020). A General Bayesian Multidimensional Item Response Theory Model for Small and Large Samples. Educational and psychological measurement, 80(4), 665–694. https://doi.org/10.1177/0013164419891205
Grottke, T., Möhrke, P. & Marvin, R. (2021). Statistische Analysen mit R in den MINT-Didaktiken Eine Tutorial-Sammlung: 4 Praxisorientierte Einf ührung in die Item-Response-Theorie mit dem Fokus auf das Rasch-Modell. Verfügbar unter: https://dbuschhue.github.io/P4-Worflow/praxisorientierte-einf
Haberman, S. J., Holland, P. W. & Sinharay, S. (2007). Limits on Log Odds Ratios for Unidimensional Item Response Theory Models. Psychometrika, 72(4), 551–561. https://doi.org/10.1007/S11336-007-9009-0
Hyvönen, V. H. & Tolonen, T. (2019). Chapter 3 Summarizing the posterior distribution | Bayesian Inference 2019. Verfügbar unter: https://vioshyvo.github.io/Bayesian_inference/summarizing-the-posterior-distribution.html
Kruschke, J. K. (2015). Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan (2. ed.). Amsterdam: AP Academic Press/Elsevier. Verfügbar unter: http://www.contentreserve.com/TitleInfo.asp?ID=38F45CF6-6B5C-433C-85F8-A3568420927D&Format=50
Kruschke, J. K. & Liddell, T. M. (2018). The Bayesian New Statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective. Psychonomic Bulletin & Review, 25(1), 178–206. https://doi.org/10.3758/s13423-016-1221-4
Kruschke, J. K. & Liddell, T. M. (2018). The Bayesian New Statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective. Psychonomic Bulletin & Review, 25(1), 178–206. https://doi.org/10.3758/s13423-016-1221-4
Meehl, P. E. (1997). The Problem Is Epistemology, Not Statistics: Replace Significance Tests by Confidence Intervals and Quantify Accuracy of Risky Numerical Predictions (Multivariate Applications Series). In L.L. Harlow, S.A. Mulaik & J.H. Steiger (Hrsg.), What If There Were No Significance Tests? (S. 393–425). Hoboken: Taylor and Francis. Verfügbar unter: https://meehl.umn.edu/sites/meehl.umn.edu/files/files/169problemisepistemology.pdf
Merkle, E. C., Furr, D. & Rabe-Hesketh, S. (2019). Bayesian Comparison of Latent Variable Models: Conditional Versus Marginal Likelihoods. Psychometrika, 84(3), 802–829. https://doi.org/10.1007/s11336-019-09679-0
Scharl, A. & Gnambs, T. (2019). Longitudinal item response modeling and posterior predictive checking in R and Stan. The Quantitative Methods for Psychology, 15(2), 75–95. https://doi.org/10.20982/tqmp.15.2.p075
Waning, B., Montagne, M., McCloskey, W. W. & Maki, R. A. (2001). Pharmacoepidemiology: Principles and practice. New York: McGraw-Hill. Verfügbar unter: http://www.loc.gov/catdir/bios/mh041/00045207.html
Zhang, O., Miller, D. & Cannady, M. (2011). A Model Evaluation When Associations Exists Across Testlets under Small Testlet Size Situations. Verfügbar unter: https://ouzhang.me/pdf/2011NCME2_slide.pdf